Dataset statistics
| Number of variables | 30 |
|---|---|
| Number of observations | 29965 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 5.3 MiB |
| Average record size in memory | 184.0 B |
Variable types
| NUM | 21 |
|---|---|
| BOOL | 8 |
| CAT | 1 |
BILL_AMT_AUG05 is highly correlated with BILL_AMT_SEP05 and 1 other fields | High correlation |
BILL_AMT_SEP05 is highly correlated with BILL_AMT_AUG05 | High correlation |
BILL_AMT_JUL05 is highly correlated with BILL_AMT_AUG05 and 1 other fields | High correlation |
BILL_AMT_JUN05 is highly correlated with BILL_AMT_JUL05 and 2 other fields | High correlation |
BILL_AMT_MAY05 is highly correlated with BILL_AMT_JUN05 and 1 other fields | High correlation |
BILL_AMT_APR05 is highly correlated with BILL_AMT_JUN05 and 1 other fields | High correlation |
GENDER_male is highly correlated with GENDER_female | High correlation |
GENDER_female is highly correlated with GENDER_male | High correlation |
DEFAULT_not default is highly correlated with DEFAULT_default | High correlation |
DEFAULT_default is highly correlated with DEFAULT_not default | High correlation |
PAY_AMT_AUG05 is highly skewed (γ1 = 30.43861292) | Skewed |
df_index has unique values | Unique |
REPAY_SEP05 has 14737 (49.2%) zeros | Zeros |
REPAY_AUG05 has 15730 (52.5%) zeros | Zeros |
REPAY_JUL05 has 15764 (52.6%) zeros | Zeros |
REPAY_JUN05 has 16455 (54.9%) zeros | Zeros |
REPAY_MAY05 has 16947 (56.6%) zeros | Zeros |
REPAY_APR05 has 16286 (54.4%) zeros | Zeros |
BILL_AMT_SEP05 has 1978 (6.6%) zeros | Zeros |
BILL_AMT_AUG05 has 2476 (8.3%) zeros | Zeros |
BILL_AMT_JUL05 has 2840 (9.5%) zeros | Zeros |
BILL_AMT_JUN05 has 3165 (10.6%) zeros | Zeros |
BILL_AMT_MAY05 has 3476 (11.6%) zeros | Zeros |
BILL_AMT_APR05 has 3990 (13.3%) zeros | Zeros |
PAY_AMT_SEP05 has 5218 (17.4%) zeros | Zeros |
PAY_AMT_AUG05 has 5365 (17.9%) zeros | Zeros |
PAY_AMT_JUL05 has 5937 (19.8%) zeros | Zeros |
PAY_AMT_JUN05 has 6377 (21.3%) zeros | Zeros |
PAY_AMT_MAY05 has 6672 (22.3%) zeros | Zeros |
PAY_AMT_APR05 has 7142 (23.8%) zeros | Zeros |
Reproduction
| Analysis started | 2021-01-12 20:37:25.369004 |
|---|---|
| Analysis finished | 2021-01-12 20:38:55.851523 |
| Duration | 1 minute and 30.48 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 29965 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14993.93252 |
|---|---|
| Minimum | 0 |
| Maximum | 29999 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 234.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1498.2 |
| Q1 | 7496 |
| median | 14991 |
| Q3 | 22493 |
| 95-th percentile | 28495.8 |
| Maximum | 29999 |
| Range | 29999 |
| Interquartile range (IQR) | 14997 |
Descriptive statistics
| Standard deviation | 8659.328323 |
|---|---|
| Coefficient of variation (CV) | 0.5775221618 |
| Kurtosis | -1.199841197 |
| Mean | 14993.93252 |
| Median Absolute Deviation (MAD) | 7499 |
| Skewness | 0.0005462130641 |
| Sum | 449293188 |
| Variance | 74983967.01 |
| Monotocity | Strictly increasing |
| Value | Count | Frequency (%) | |
| 2047 | 1 | < 0.1% | |
| 1322 | 1 | < 0.1% | |
| 15629 | 1 | < 0.1% | |
| 9486 | 1 | < 0.1% | |
| 11535 | 1 | < 0.1% | |
| 21792 | 1 | < 0.1% | |
| 23841 | 1 | < 0.1% | |
| 17698 | 1 | < 0.1% | |
| 19747 | 1 | < 0.1% | |
| 29988 | 1 | < 0.1% | |
| Other values (29955) | 29955 | > 99.9% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 29999 | 1 | < 0.1% | |
| 29998 | 1 | < 0.1% | |
| 29997 | 1 | < 0.1% | |
| 29996 | 1 | < 0.1% | |
| 29995 | 1 | < 0.1% |
LIMIT_BAL
Real number (ℝ≥0)
| Distinct | 81 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 167442.005 |
|---|---|
| Minimum | 10000 |
| Maximum | 1000000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 234.1 KiB |
Quantile statistics
| Minimum | 10000 |
|---|---|
| 5-th percentile | 20000 |
| Q1 | 50000 |
| median | 140000 |
| Q3 | 240000 |
| 95-th percentile | 430000 |
| Maximum | 1000000 |
| Range | 990000 |
| Interquartile range (IQR) | 190000 |
Descriptive statistics
| Standard deviation | 129760.1352 |
|---|---|
| Coefficient of variation (CV) | 0.7749556942 |
| Kurtosis | 0.5375871217 |
| Mean | 167442.005 |
| Median Absolute Deviation (MAD) | 90000 |
| Skewness | 0.9934913272 |
| Sum | 5017399680 |
| Variance | 1.683769269e+10 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 50000 | 3363 | 11.2% | |
| 20000 | 1975 | 6.6% | |
| 30000 | 1610 | 5.4% | |
| 80000 | 1564 | 5.2% | |
| 200000 | 1524 | 5.1% | |
| 150000 | 1107 | 3.7% | |
| 100000 | 1047 | 3.5% | |
| 180000 | 993 | 3.3% | |
| 360000 | 874 | 2.9% | |
| 60000 | 825 | 2.8% | |
| Other values (71) | 15083 | 50.3% |
| Value | Count | Frequency (%) | |
| 10000 | 493 | 1.6% | |
| 16000 | 2 | < 0.1% | |
| 20000 | 1975 | 6.6% | |
| 30000 | 1610 | 5.4% | |
| 40000 | 230 | 0.8% |
| Value | Count | Frequency (%) | |
| 1000000 | 1 | < 0.1% | |
| 800000 | 2 | < 0.1% | |
| 780000 | 2 | < 0.1% | |
| 760000 | 1 | < 0.1% | |
| 750000 | 4 | < 0.1% |
MARITAL_STATUS
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 234.1 KiB |
| 2 | |
|---|---|
| 1 | |
| 3 | 323 |
| 0 | 54 |
| Value | Count | Frequency (%) | |
| 2 | 15945 | 53.2% | |
| 1 | 13643 | 45.5% | |
| 3 | 323 | 1.1% | |
| 0 | 54 | 0.2% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
AGE
Real number (ℝ≥0)
| Distinct | 56 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35.4879693 |
|---|---|
| Minimum | 21 |
| Maximum | 79 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 234.1 KiB |
Quantile statistics
| Minimum | 21 |
|---|---|
| 5-th percentile | 23 |
| Q1 | 28 |
| median | 34 |
| Q3 | 41 |
| 95-th percentile | 53 |
| Maximum | 79 |
| Range | 58 |
| Interquartile range (IQR) | 13 |
Descriptive statistics
| Standard deviation | 9.219459233 |
|---|---|
| Coefficient of variation (CV) | 0.2597911184 |
| Kurtosis | 0.04398801494 |
| Mean | 35.4879693 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 0.7320560019 |
| Sum | 1063397 |
| Variance | 84.99842855 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 29 | 1602 | 5.3% | |
| 27 | 1475 | 4.9% | |
| 28 | 1406 | 4.7% | |
| 30 | 1394 | 4.7% | |
| 26 | 1252 | 4.2% | |
| 31 | 1213 | 4.0% | |
| 25 | 1185 | 4.0% | |
| 34 | 1161 | 3.9% | |
| 32 | 1157 | 3.9% | |
| 33 | 1146 | 3.8% | |
| Other values (46) | 16974 | 56.6% |
| Value | Count | Frequency (%) | |
| 21 | 67 | 0.2% | |
| 22 | 560 | 1.9% | |
| 23 | 930 | 3.1% | |
| 24 | 1126 | 3.8% | |
| 25 | 1185 | 4.0% |
| Value | Count | Frequency (%) | |
| 79 | 1 | < 0.1% | |
| 75 | 3 | < 0.1% | |
| 74 | 1 | < 0.1% | |
| 73 | 4 | < 0.1% | |
| 72 | 3 | < 0.1% |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.01675287836 |
|---|---|
| Minimum | -2 |
| Maximum | 8 |
| Zeros | 14737 |
| Zeros (%) | 49.2% |
| Memory size | 234.1 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 8 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.123492034 |
|---|---|
| Coefficient of variation (CV) | -67.06262707 |
| Kurtosis | 2.730038381 |
| Mean | -0.01675287836 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.7346064765 |
| Sum | -502 |
| Variance | 1.26223435 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 14737 | 49.2% | |
| -1 | 5682 | 19.0% | |
| 1 | 3667 | 12.2% | |
| -2 | 2750 | 9.2% | |
| 2 | 2666 | 8.9% | |
| 3 | 322 | 1.1% | |
| 4 | 76 | 0.3% | |
| 5 | 26 | 0.1% | |
| 8 | 19 | 0.1% | |
| 6 | 11 | < 0.1% |
| Value | Count | Frequency (%) | |
| -2 | 2750 | 9.2% | |
| -1 | 5682 | 19.0% | |
| 0 | 14737 | 49.2% | |
| 1 | 3667 | 12.2% | |
| 2 | 2666 | 8.9% |
| Value | Count | Frequency (%) | |
| 8 | 19 | 0.1% | |
| 7 | 9 | < 0.1% | |
| 6 | 11 | < 0.1% | |
| 5 | 26 | 0.1% | |
| 4 | 76 | 0.3% |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.1318538295 |
|---|---|
| Minimum | -2 |
| Maximum | 8 |
| Zeros | 15730 |
| Zeros (%) | 52.5% |
| Memory size | 234.1 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 8 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.196321699 |
|---|---|
| Coefficient of variation (CV) | -9.073090281 |
| Kurtosis | 1.577608705 |
| Mean | -0.1318538295 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.7920704147 |
| Sum | -3951 |
| Variance | 1.431185607 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 15730 | 52.5% | |
| -1 | 6046 | 20.2% | |
| 2 | 3926 | 13.1% | |
| -2 | 3752 | 12.5% | |
| 3 | 326 | 1.1% | |
| 4 | 99 | 0.3% | |
| 1 | 28 | 0.1% | |
| 5 | 25 | 0.1% | |
| 7 | 20 | 0.1% | |
| 6 | 12 | < 0.1% |
| Value | Count | Frequency (%) | |
| -2 | 3752 | 12.5% | |
| -1 | 6046 | 20.2% | |
| 0 | 15730 | 52.5% | |
| 1 | 28 | 0.1% | |
| 2 | 3926 | 13.1% |
| Value | Count | Frequency (%) | |
| 8 | 1 | < 0.1% | |
| 7 | 20 | 0.1% | |
| 6 | 12 | < 0.1% | |
| 5 | 25 | 0.1% | |
| 4 | 99 | 0.3% |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.1643917904 |
|---|---|
| Minimum | -2 |
| Maximum | 8 |
| Zeros | 15764 |
| Zeros (%) | 52.6% |
| Memory size | 234.1 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 8 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.195877509 |
|---|---|
| Coefficient of variation (CV) | -7.274557358 |
| Kurtosis | 2.091665951 |
| Mean | -0.1643917904 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.8414639808 |
| Sum | -4926 |
| Variance | 1.430123016 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 15764 | 52.6% | |
| -1 | 5934 | 19.8% | |
| -2 | 4055 | 13.5% | |
| 2 | 3819 | 12.7% | |
| 3 | 240 | 0.8% | |
| 4 | 75 | 0.3% | |
| 7 | 27 | 0.1% | |
| 6 | 23 | 0.1% | |
| 5 | 21 | 0.1% | |
| 1 | 4 | < 0.1% |
| Value | Count | Frequency (%) | |
| -2 | 4055 | 13.5% | |
| -1 | 5934 | 19.8% | |
| 0 | 15764 | 52.6% | |
| 1 | 4 | < 0.1% | |
| 2 | 3819 | 12.7% |
| Value | Count | Frequency (%) | |
| 8 | 3 | < 0.1% | |
| 7 | 27 | 0.1% | |
| 6 | 23 | 0.1% | |
| 5 | 21 | 0.1% | |
| 4 | 75 | 0.3% |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.2189220758 |
|---|---|
| Minimum | -2 |
| Maximum | 8 |
| Zeros | 16455 |
| Zeros (%) | 54.9% |
| Memory size | 234.1 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 8 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.168175186 |
|---|---|
| Coefficient of variation (CV) | -5.33603193 |
| Kurtosis | 3.508962108 |
| Mean | -0.2189220758 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.000798562 |
| Sum | -6560 |
| Variance | 1.364633266 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 16455 | 54.9% | |
| -1 | 5683 | 19.0% | |
| -2 | 4318 | 14.4% | |
| 2 | 3159 | 10.5% | |
| 3 | 180 | 0.6% | |
| 4 | 68 | 0.2% | |
| 7 | 58 | 0.2% | |
| 5 | 35 | 0.1% | |
| 6 | 5 | < 0.1% | |
| 8 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| -2 | 4318 | 14.4% | |
| -1 | 5683 | 19.0% | |
| 0 | 16455 | 54.9% | |
| 1 | 2 | < 0.1% | |
| 2 | 3159 | 10.5% |
| Value | Count | Frequency (%) | |
| 8 | 2 | < 0.1% | |
| 7 | 58 | 0.2% | |
| 6 | 5 | < 0.1% | |
| 5 | 35 | 0.1% | |
| 4 | 68 | 0.2% |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.2645085934 |
|---|---|
| Minimum | -2 |
| Maximum | 8 |
| Zeros | 16947 |
| Zeros (%) | 56.6% |
| Memory size | 234.1 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 8 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.132219856 |
|---|---|
| Coefficient of variation (CV) | -4.280465302 |
| Kurtosis | 4.003562263 |
| Mean | -0.2645085934 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.009329021 |
| Sum | -7926 |
| Variance | 1.281921802 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 16947 | 56.6% | |
| -1 | 5535 | 18.5% | |
| -2 | 4516 | 15.1% | |
| 2 | 2626 | 8.8% | |
| 3 | 178 | 0.6% | |
| 4 | 83 | 0.3% | |
| 7 | 58 | 0.2% | |
| 5 | 17 | 0.1% | |
| 6 | 4 | < 0.1% | |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| -2 | 4516 | 15.1% | |
| -1 | 5535 | 18.5% | |
| 0 | 16947 | 56.6% | |
| 2 | 2626 | 8.8% | |
| 3 | 178 | 0.6% |
| Value | Count | Frequency (%) | |
| 8 | 1 | < 0.1% | |
| 7 | 58 | 0.2% | |
| 6 | 4 | < 0.1% | |
| 5 | 17 | 0.1% | |
| 4 | 83 | 0.3% |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.2894376773 |
|---|---|
| Minimum | -2 |
| Maximum | 8 |
| Zeros | 16286 |
| Zeros (%) | 54.4% |
| Memory size | 234.1 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 8 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.1490901 |
|---|---|
| Coefficient of variation (CV) | -3.970077809 |
| Kurtosis | 3.437256875 |
| Mean | -0.2894376773 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.9486089933 |
| Sum | -8673 |
| Variance | 1.320408057 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 16286 | 54.4% | |
| -1 | 5736 | 19.1% | |
| -2 | 4865 | 16.2% | |
| 2 | 2766 | 9.2% | |
| 3 | 184 | 0.6% | |
| 4 | 48 | 0.2% | |
| 7 | 46 | 0.2% | |
| 6 | 19 | 0.1% | |
| 5 | 13 | < 0.1% | |
| 8 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| -2 | 4865 | 16.2% | |
| -1 | 5736 | 19.1% | |
| 0 | 16286 | 54.4% | |
| 2 | 2766 | 9.2% | |
| 3 | 184 | 0.6% |
| Value | Count | Frequency (%) | |
| 8 | 2 | < 0.1% | |
| 7 | 46 | 0.2% | |
| 6 | 19 | 0.1% | |
| 5 | 13 | < 0.1% | |
| 4 | 48 | 0.2% |
| Distinct | 22723 |
|---|---|
| Distinct (%) | 75.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 51283.00978 |
|---|---|
| Minimum | -165580 |
| Maximum | 964511 |
| Zeros | 1978 |
| Zeros (%) | 6.6% |
| Memory size | 234.1 KiB |
Quantile statistics
| Minimum | -165580 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3595 |
| median | 22438 |
| Q3 | 67260 |
| 95-th percentile | 201303.8 |
| Maximum | 964511 |
| Range | 1130091 |
| Interquartile range (IQR) | 63665 |
Descriptive statistics
| Standard deviation | 73658.1324 |
|---|---|
| Coefficient of variation (CV) | 1.436306736 |
| Kurtosis | 9.796846218 |
| Mean | 51283.00978 |
| Median Absolute Deviation (MAD) | 21842 |
| Skewness | 2.662513456 |
| Sum | 1536695388 |
| Variance | 5425520469 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 1978 | 6.6% | |
| 390 | 243 | 0.8% | |
| 780 | 76 | 0.3% | |
| 326 | 72 | 0.2% | |
| 316 | 63 | 0.2% | |
| 2500 | 59 | 0.2% | |
| 396 | 48 | 0.2% | |
| 2400 | 39 | 0.1% | |
| 416 | 29 | 0.1% | |
| 1050 | 25 | 0.1% | |
| Other values (22713) | 27333 | 91.2% |
| Value | Count | Frequency (%) | |
| -165580 | 1 | < 0.1% | |
| -154973 | 1 | < 0.1% | |
| -15308 | 1 | < 0.1% | |
| -14386 | 1 | < 0.1% | |
| -11545 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 964511 | 1 | < 0.1% | |
| 746814 | 1 | < 0.1% | |
| 653062 | 1 | < 0.1% | |
| 630458 | 1 | < 0.1% | |
| 626648 | 1 | < 0.1% |
| Distinct | 22346 |
|---|---|
| Distinct (%) | 74.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49236.36629 |
|---|---|
| Minimum | -69777 |
| Maximum | 983931 |
| Zeros | 2476 |
| Zeros (%) | 8.3% |
| Memory size | 234.1 KiB |
Quantile statistics
| Minimum | -69777 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3010 |
| median | 21295 |
| Q3 | 64109 |
| 95-th percentile | 194889.6 |
| Maximum | 983931 |
| Range | 1053708 |
| Interquartile range (IQR) | 61099 |
Descriptive statistics
| Standard deviation | 71195.56739 |
|---|---|
| Coefficient of variation (CV) | 1.445995567 |
| Kurtosis | 10.29321199 |
| Mean | 49236.36629 |
| Median Absolute Deviation (MAD) | 20905 |
| Skewness | 2.70386174 |
| Sum | 1475367716 |
| Variance | 5068808816 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 2476 | 8.3% | |
| 390 | 230 | 0.8% | |
| 326 | 75 | 0.3% | |
| 780 | 75 | 0.3% | |
| 316 | 72 | 0.2% | |
| 2500 | 51 | 0.2% | |
| 396 | 50 | 0.2% | |
| 2400 | 42 | 0.1% | |
| -200 | 29 | 0.1% | |
| 416 | 28 | 0.1% | |
| Other values (22336) | 26837 | 89.6% |
| Value | Count | Frequency (%) | |
| -69777 | 1 | < 0.1% | |
| -67526 | 1 | < 0.1% | |
| -33350 | 1 | < 0.1% | |
| -30000 | 1 | < 0.1% | |
| -26214 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 983931 | 1 | < 0.1% | |
| 743970 | 1 | < 0.1% | |
| 671563 | 1 | < 0.1% | |
| 646770 | 1 | < 0.1% | |
| 624475 | 1 | < 0.1% |
| Distinct | 22026 |
|---|---|
| Distinct (%) | 73.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 47067.91607 |
|---|---|
| Minimum | -157264 |
| Maximum | 1664089 |
| Zeros | 2840 |
| Zeros (%) | 9.5% |
| Memory size | 234.1 KiB |
Quantile statistics
| Minimum | -157264 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2711 |
| median | 20135 |
| Q3 | 60201 |
| 95-th percentile | 187901 |
| Maximum | 1664089 |
| Range | 1821353 |
| Interquartile range (IQR) | 57490 |
Descriptive statistics
| Standard deviation | 69371.35232 |
|---|---|
| Coefficient of variation (CV) | 1.473856464 |
| Kurtosis | 19.77100256 |
| Mean | 47067.91607 |
| Median Absolute Deviation (MAD) | 19745 |
| Skewness | 3.086493832 |
| Sum | 1410390105 |
| Variance | 4812384523 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 2840 | 9.5% | |
| 390 | 274 | 0.9% | |
| 780 | 74 | 0.2% | |
| 326 | 63 | 0.2% | |
| 316 | 62 | 0.2% | |
| 396 | 47 | 0.2% | |
| 2500 | 40 | 0.1% | |
| 2400 | 39 | 0.1% | |
| 416 | 29 | 0.1% | |
| 200 | 27 | 0.1% | |
| Other values (22016) | 26470 | 88.3% |
| Value | Count | Frequency (%) | |
| -157264 | 1 | < 0.1% | |
| -61506 | 1 | < 0.1% | |
| -46127 | 1 | < 0.1% | |
| -34041 | 1 | < 0.1% | |
| -25443 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1664089 | 1 | < 0.1% | |
| 855086 | 1 | < 0.1% | |
| 693131 | 1 | < 0.1% | |
| 689643 | 1 | < 0.1% | |
| 689627 | 1 | < 0.1% |
| Distinct | 21548 |
|---|---|
| Distinct (%) | 71.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 43313.32988 |
|---|---|
| Minimum | -170000 |
| Maximum | 891586 |
| Zeros | 3165 |
| Zeros (%) | 10.6% |
| Memory size | 234.1 KiB |
Quantile statistics
| Minimum | -170000 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2360 |
| median | 19081 |
| Q3 | 54601 |
| 95-th percentile | 174469.8 |
| Maximum | 891586 |
| Range | 1061586 |
| Interquartile range (IQR) | 52241 |
Descriptive statistics
| Standard deviation | 64353.51437 |
|---|---|
| Coefficient of variation (CV) | 1.485766958 |
| Kurtosis | 11.29858229 |
| Mean | 43313.32988 |
| Median Absolute Deviation (MAD) | 18681 |
| Skewness | 2.820544832 |
| Sum | 1297883930 |
| Variance | 4141374812 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 3165 | 10.6% | |
| 390 | 245 | 0.8% | |
| 780 | 101 | 0.3% | |
| 316 | 68 | 0.2% | |
| 326 | 62 | 0.2% | |
| 396 | 43 | 0.1% | |
| 150 | 39 | 0.1% | |
| 2400 | 39 | 0.1% | |
| 2500 | 34 | 0.1% | |
| 1000 | 33 | 0.1% | |
| Other values (21538) | 26136 | 87.2% |
| Value | Count | Frequency (%) | |
| -170000 | 1 | < 0.1% | |
| -81334 | 1 | < 0.1% | |
| -65167 | 1 | < 0.1% | |
| -50616 | 1 | < 0.1% | |
| -46627 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 891586 | 1 | < 0.1% | |
| 706864 | 1 | < 0.1% | |
| 628699 | 1 | < 0.1% | |
| 616836 | 1 | < 0.1% | |
| 572805 | 1 | < 0.1% |
| Distinct | 21010 |
|---|---|
| Distinct (%) | 70.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40358.33439 |
|---|---|
| Minimum | -81334 |
| Maximum | 927171 |
| Zeros | 3476 |
| Zeros (%) | 11.6% |
| Memory size | 234.1 KiB |
Quantile statistics
| Minimum | -81334 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1787 |
| median | 18130 |
| Q3 | 50247 |
| 95-th percentile | 165805.6 |
| Maximum | 927171 |
| Range | 1008505 |
| Interquartile range (IQR) | 48460 |
Descriptive statistics
| Standard deviation | 60817.13062 |
|---|---|
| Coefficient of variation (CV) | 1.506928657 |
| Kurtosis | 12.29453891 |
| Mean | 40358.33439 |
| Median Absolute Deviation (MAD) | 17714 |
| Skewness | 2.874925049 |
| Sum | 1209337490 |
| Variance | 3698723377 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 3476 | 11.6% | |
| 390 | 234 | 0.8% | |
| 780 | 94 | 0.3% | |
| 316 | 79 | 0.3% | |
| 326 | 62 | 0.2% | |
| 150 | 58 | 0.2% | |
| 396 | 46 | 0.2% | |
| 2400 | 39 | 0.1% | |
| 2500 | 37 | 0.1% | |
| 416 | 36 | 0.1% | |
| Other values (21000) | 25804 | 86.1% |
| Value | Count | Frequency (%) | |
| -81334 | 1 | < 0.1% | |
| -61372 | 1 | < 0.1% | |
| -53007 | 1 | < 0.1% | |
| -46627 | 1 | < 0.1% | |
| -37594 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 927171 | 1 | < 0.1% | |
| 823540 | 1 | < 0.1% | |
| 587067 | 1 | < 0.1% | |
| 551702 | 1 | < 0.1% | |
| 547880 | 1 | < 0.1% |
| Distinct | 20604 |
|---|---|
| Distinct (%) | 68.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38917.01228 |
|---|---|
| Minimum | -339603 |
| Maximum | 961664 |
| Zeros | 3990 |
| Zeros (%) | 13.3% |
| Memory size | 234.1 KiB |
Quantile statistics
| Minimum | -339603 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1262 |
| median | 17124 |
| Q3 | 49252 |
| 95-th percentile | 161932 |
| Maximum | 961664 |
| Range | 1301267 |
| Interquartile range (IQR) | 47990 |
Descriptive statistics
| Standard deviation | 59574.14774 |
|---|---|
| Coefficient of variation (CV) | 1.530799623 |
| Kurtosis | 12.25912611 |
| Mean | 38917.01228 |
| Median Absolute Deviation (MAD) | 16808 |
| Skewness | 2.845137169 |
| Sum | 1166148273 |
| Variance | 3549079079 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 3990 | 13.3% | |
| 390 | 206 | 0.7% | |
| 780 | 86 | 0.3% | |
| 150 | 78 | 0.3% | |
| 316 | 77 | 0.3% | |
| 326 | 56 | 0.2% | |
| 396 | 44 | 0.1% | |
| 416 | 36 | 0.1% | |
| -18 | 33 | 0.1% | |
| 2400 | 32 | 0.1% | |
| Other values (20594) | 25327 | 84.5% |
| Value | Count | Frequency (%) | |
| -339603 | 1 | < 0.1% | |
| -209051 | 1 | < 0.1% | |
| -150953 | 1 | < 0.1% | |
| -94625 | 1 | < 0.1% | |
| -73895 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 961664 | 1 | < 0.1% | |
| 699944 | 1 | < 0.1% | |
| 568638 | 1 | < 0.1% | |
| 527711 | 1 | < 0.1% | |
| 527566 | 1 | < 0.1% |
| Distinct | 7943 |
|---|---|
| Distinct (%) | 26.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5670.099316 |
|---|---|
| Minimum | 0 |
| Maximum | 873552 |
| Zeros | 5218 |
| Zeros (%) | 17.4% |
| Memory size | 234.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1000 |
| median | 2102 |
| Q3 | 5008 |
| 95-th percentile | 18447.2 |
| Maximum | 873552 |
| Range | 873552 |
| Interquartile range (IQR) | 4008 |
Descriptive statistics
| Standard deviation | 16571.84947 |
|---|---|
| Coefficient of variation (CV) | 2.92267358 |
| Kurtosis | 414.8548633 |
| Mean | 5670.099316 |
| Median Absolute Deviation (MAD) | 1929 |
| Skewness | 14.66159454 |
| Sum | 169904526 |
| Variance | 274626194.7 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 5218 | 17.4% | |
| 2000 | 1363 | 4.5% | |
| 3000 | 891 | 3.0% | |
| 5000 | 698 | 2.3% | |
| 1500 | 507 | 1.7% | |
| 4000 | 426 | 1.4% | |
| 10000 | 401 | 1.3% | |
| 1000 | 365 | 1.2% | |
| 2500 | 298 | 1.0% | |
| 6000 | 294 | 1.0% | |
| Other values (7933) | 19504 | 65.1% |
| Value | Count | Frequency (%) | |
| 0 | 5218 | 17.4% | |
| 1 | 9 | < 0.1% | |
| 2 | 14 | < 0.1% | |
| 3 | 15 | 0.1% | |
| 4 | 18 | 0.1% |
| Value | Count | Frequency (%) | |
| 873552 | 1 | < 0.1% | |
| 505000 | 1 | < 0.1% | |
| 493358 | 1 | < 0.1% | |
| 423903 | 1 | < 0.1% | |
| 405016 | 1 | < 0.1% |
| Distinct | 7899 |
|---|---|
| Distinct (%) | 26.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5927.98318 |
|---|---|
| Minimum | 0 |
| Maximum | 1684259 |
| Zeros | 5365 |
| Zeros (%) | 17.9% |
| Memory size | 234.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 850 |
| median | 2010 |
| Q3 | 5000 |
| 95-th percentile | 19030.8 |
| Maximum | 1684259 |
| Range | 1684259 |
| Interquartile range (IQR) | 4150 |
Descriptive statistics
| Standard deviation | 23053.45664 |
|---|---|
| Coefficient of variation (CV) | 3.888920724 |
| Kurtosis | 1639.924451 |
| Mean | 5927.98318 |
| Median Absolute Deviation (MAD) | 1990 |
| Skewness | 30.43861292 |
| Sum | 177632016 |
| Variance | 531461863.3 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 5365 | 17.9% | |
| 2000 | 1290 | 4.3% | |
| 3000 | 857 | 2.9% | |
| 5000 | 717 | 2.4% | |
| 1000 | 594 | 2.0% | |
| 1500 | 521 | 1.7% | |
| 4000 | 410 | 1.4% | |
| 10000 | 318 | 1.1% | |
| 6000 | 283 | 0.9% | |
| 2500 | 251 | 0.8% | |
| Other values (7889) | 19359 | 64.6% |
| Value | Count | Frequency (%) | |
| 0 | 5365 | 17.9% | |
| 1 | 15 | 0.1% | |
| 2 | 20 | 0.1% | |
| 3 | 18 | 0.1% | |
| 4 | 11 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1684259 | 1 | < 0.1% | |
| 1227082 | 1 | < 0.1% | |
| 1215471 | 1 | < 0.1% | |
| 1024516 | 1 | < 0.1% | |
| 580464 | 1 | < 0.1% |
| Distinct | 7518 |
|---|---|
| Distinct (%) | 25.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5231.688837 |
|---|---|
| Minimum | 0 |
| Maximum | 896040 |
| Zeros | 5937 |
| Zeros (%) | 19.8% |
| Memory size | 234.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 390 |
| median | 1804 |
| Q3 | 4512 |
| 95-th percentile | 17602.6 |
| Maximum | 896040 |
| Range | 896040 |
| Interquartile range (IQR) | 4122 |
Descriptive statistics
| Standard deviation | 17616.36112 |
|---|---|
| Coefficient of variation (CV) | 3.367241759 |
| Kurtosis | 563.7392771 |
| Mean | 5231.688837 |
| Median Absolute Deviation (MAD) | 1796 |
| Skewness | 17.2081766 |
| Sum | 156767556 |
| Variance | 310336179.3 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 5937 | 19.8% | |
| 2000 | 1285 | 4.3% | |
| 1000 | 1103 | 3.7% | |
| 3000 | 870 | 2.9% | |
| 5000 | 721 | 2.4% | |
| 1500 | 490 | 1.6% | |
| 4000 | 381 | 1.3% | |
| 10000 | 312 | 1.0% | |
| 1200 | 243 | 0.8% | |
| 6000 | 241 | 0.8% | |
| Other values (7508) | 18382 | 61.3% |
| Value | Count | Frequency (%) | |
| 0 | 5937 | 19.8% | |
| 1 | 13 | < 0.1% | |
| 2 | 19 | 0.1% | |
| 3 | 14 | < 0.1% | |
| 4 | 15 | 0.1% |
| Value | Count | Frequency (%) | |
| 896040 | 1 | < 0.1% | |
| 889043 | 1 | < 0.1% | |
| 508229 | 1 | < 0.1% | |
| 417588 | 1 | < 0.1% | |
| 400972 | 1 | < 0.1% |
| Distinct | 6937 |
|---|---|
| Distinct (%) | 23.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4831.617454 |
|---|---|
| Minimum | 0 |
| Maximum | 621000 |
| Zeros | 6377 |
| Zeros (%) | 21.3% |
| Memory size | 234.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 300 |
| median | 1500 |
| Q3 | 4016 |
| 95-th percentile | 16037 |
| Maximum | 621000 |
| Range | 621000 |
| Interquartile range (IQR) | 3716 |
Descriptive statistics
| Standard deviation | 15674.46454 |
|---|---|
| Coefficient of variation (CV) | 3.244144365 |
| Kurtosis | 277.0486932 |
| Mean | 4831.617454 |
| Median Absolute Deviation (MAD) | 1500 |
| Skewness | 12.89850649 |
| Sum | 144779417 |
| Variance | 245688838.5 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 6377 | 21.3% | |
| 1000 | 1394 | 4.7% | |
| 2000 | 1214 | 4.1% | |
| 3000 | 887 | 3.0% | |
| 5000 | 810 | 2.7% | |
| 1500 | 441 | 1.5% | |
| 4000 | 402 | 1.3% | |
| 10000 | 341 | 1.1% | |
| 2500 | 259 | 0.9% | |
| 500 | 258 | 0.9% | |
| Other values (6927) | 17582 | 58.7% |
| Value | Count | Frequency (%) | |
| 0 | 6377 | 21.3% | |
| 1 | 22 | 0.1% | |
| 2 | 22 | 0.1% | |
| 3 | 13 | < 0.1% | |
| 4 | 20 | 0.1% |
| Value | Count | Frequency (%) | |
| 621000 | 1 | < 0.1% | |
| 528897 | 1 | < 0.1% | |
| 497000 | 1 | < 0.1% | |
| 432130 | 1 | < 0.1% | |
| 400046 | 1 | < 0.1% |
| Distinct | 6897 |
|---|---|
| Distinct (%) | 23.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4804.897047 |
|---|---|
| Minimum | 0 |
| Maximum | 426529 |
| Zeros | 6672 |
| Zeros (%) | 22.3% |
| Memory size | 234.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 261 |
| median | 1500 |
| Q3 | 4042 |
| 95-th percentile | 16000 |
| Maximum | 426529 |
| Range | 426529 |
| Interquartile range (IQR) | 3781 |
Descriptive statistics
| Standard deviation | 15286.3723 |
|---|---|
| Coefficient of variation (CV) | 3.181415158 |
| Kurtosis | 179.8752095 |
| Mean | 4804.897047 |
| Median Absolute Deviation (MAD) | 1500 |
| Skewness | 11.12174174 |
| Sum | 143978740 |
| Variance | 233673178 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 6672 | 22.3% | |
| 1000 | 1340 | 4.5% | |
| 2000 | 1323 | 4.4% | |
| 3000 | 947 | 3.2% | |
| 5000 | 814 | 2.7% | |
| 1500 | 426 | 1.4% | |
| 4000 | 401 | 1.3% | |
| 10000 | 343 | 1.1% | |
| 500 | 250 | 0.8% | |
| 6000 | 247 | 0.8% | |
| Other values (6887) | 17202 | 57.4% |
| Value | Count | Frequency (%) | |
| 0 | 6672 | 22.3% | |
| 1 | 21 | 0.1% | |
| 2 | 13 | < 0.1% | |
| 3 | 13 | < 0.1% | |
| 4 | 12 | < 0.1% |
| Value | Count | Frequency (%) | |
| 426529 | 1 | < 0.1% | |
| 417990 | 1 | < 0.1% | |
| 388071 | 1 | < 0.1% | |
| 379267 | 1 | < 0.1% | |
| 332000 | 1 | < 0.1% |
| Distinct | 6939 |
|---|---|
| Distinct (%) | 23.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5221.498014 |
|---|---|
| Minimum | 0 |
| Maximum | 528666 |
| Zeros | 7142 |
| Zeros (%) | 23.8% |
| Memory size | 234.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 131 |
| median | 1500 |
| Q3 | 4000 |
| 95-th percentile | 17384.4 |
| Maximum | 528666 |
| Range | 528666 |
| Interquartile range (IQR) | 3869 |
Descriptive statistics
| Standard deviation | 17786.97686 |
|---|---|
| Coefficient of variation (CV) | 3.406489252 |
| Kurtosis | 166.9817897 |
| Mean | 5221.498014 |
| Median Absolute Deviation (MAD) | 1500 |
| Skewness | 10.63509397 |
| Sum | 156462188 |
| Variance | 316376546 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 7142 | 23.8% | |
| 1000 | 1299 | 4.3% | |
| 2000 | 1295 | 4.3% | |
| 3000 | 914 | 3.1% | |
| 5000 | 808 | 2.7% | |
| 1500 | 439 | 1.5% | |
| 4000 | 411 | 1.4% | |
| 10000 | 356 | 1.2% | |
| 500 | 247 | 0.8% | |
| 6000 | 220 | 0.7% | |
| Other values (6929) | 16834 | 56.2% |
| Value | Count | Frequency (%) | |
| 0 | 7142 | 23.8% | |
| 1 | 20 | 0.1% | |
| 2 | 9 | < 0.1% | |
| 3 | 14 | < 0.1% | |
| 4 | 12 | < 0.1% |
| Value | Count | Frequency (%) | |
| 528666 | 1 | < 0.1% | |
| 527143 | 1 | < 0.1% | |
| 443001 | 1 | < 0.1% | |
| 422000 | 1 | < 0.1% | |
| 403500 | 1 | < 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.3 KiB |
| 1 | |
|---|---|
| 0 |
| Value | Count | Frequency (%) | |
| 1 | 18091 | 60.4% | |
| 0 | 11874 | 39.6% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.3 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 18091 | 60.4% | |
| 1 | 11874 | 39.6% |
EDUCATION_graduate school
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.3 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 19402 | 64.7% | |
| 1 | 10563 | 35.3% |
EDUCATION_high school
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.3 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 25050 | 83.6% | |
| 1 | 4915 | 16.4% |
EDUCATION_other
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.3 KiB |
| 0 | |
|---|---|
| 1 | 468 |
| Value | Count | Frequency (%) | |
| 0 | 29497 | 98.4% | |
| 1 | 468 | 1.6% |
EDUCATION_university
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.3 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 15946 | 53.2% | |
| 1 | 14019 | 46.8% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.3 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 23335 | 77.9% | |
| 1 | 6630 | 22.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | LIMIT_BAL | MARITAL_STATUS | AGE | REPAY_SEP05 | REPAY_AUG05 | REPAY_JUL05 | REPAY_JUN05 | REPAY_MAY05 | REPAY_APR05 | BILL_AMT_SEP05 | BILL_AMT_AUG05 | BILL_AMT_JUL05 | BILL_AMT_JUN05 | BILL_AMT_MAY05 | BILL_AMT_APR05 | PAY_AMT_SEP05 | PAY_AMT_AUG05 | PAY_AMT_JUL05 | PAY_AMT_JUN05 | PAY_AMT_MAY05 | PAY_AMT_APR05 | GENDER_female | GENDER_male | EDUCATION_graduate school | EDUCATION_high school | EDUCATION_other | EDUCATION_university | DEFAULT_default | DEFAULT_not default | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 20000 | 1 | 24 | 2 | 2 | -1 | -1 | -2 | -2 | 3913 | 3102 | 689 | 0 | 0 | 0 | 0 | 689 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 1 | 0 |
| 1 | 1 | 120000 | 2 | 26 | -1 | 2 | 0 | 0 | 0 | 2 | 2682 | 1725 | 2682 | 3272 | 3455 | 3261 | 0 | 1000 | 1000 | 1000 | 0 | 2000 | 1 | 0 | 0 | 0 | 0 | 1 | 1 | 0 |
| 2 | 2 | 90000 | 2 | 34 | 0 | 0 | 0 | 0 | 0 | 0 | 29239 | 14027 | 13559 | 14331 | 14948 | 15549 | 1518 | 1500 | 1000 | 1000 | 1000 | 5000 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 1 |
| 3 | 3 | 50000 | 1 | 37 | 0 | 0 | 0 | 0 | 0 | 0 | 46990 | 48233 | 49291 | 28314 | 28959 | 29547 | 2000 | 2019 | 1200 | 1100 | 1069 | 1000 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 1 |
| 4 | 4 | 50000 | 1 | 57 | -1 | 0 | -1 | 0 | 0 | 0 | 8617 | 5670 | 35835 | 20940 | 19146 | 19131 | 2000 | 36681 | 10000 | 9000 | 689 | 679 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 1 |
| 5 | 5 | 50000 | 2 | 37 | 0 | 0 | 0 | 0 | 0 | 0 | 64400 | 57069 | 57608 | 19394 | 19619 | 20024 | 2500 | 1815 | 657 | 1000 | 1000 | 800 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 1 |
| 6 | 6 | 500000 | 2 | 29 | 0 | 0 | 0 | 0 | 0 | 0 | 367965 | 412023 | 445007 | 542653 | 483003 | 473944 | 55000 | 40000 | 38000 | 20239 | 13750 | 13770 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 1 |
| 7 | 7 | 100000 | 2 | 23 | 0 | -1 | -1 | 0 | 0 | -1 | 11876 | 380 | 601 | 221 | -159 | 567 | 380 | 601 | 0 | 581 | 1687 | 1542 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 1 |
| 8 | 8 | 140000 | 1 | 28 | 0 | 0 | 2 | 0 | 0 | 0 | 11285 | 14096 | 12108 | 12211 | 11793 | 3719 | 3329 | 0 | 432 | 1000 | 1000 | 1000 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 1 |
| 9 | 9 | 20000 | 2 | 35 | -2 | -2 | -2 | -2 | -1 | -1 | 0 | 0 | 0 | 0 | 13007 | 13912 | 0 | 0 | 0 | 13007 | 1122 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 1 |
Last rows
| df_index | LIMIT_BAL | MARITAL_STATUS | AGE | REPAY_SEP05 | REPAY_AUG05 | REPAY_JUL05 | REPAY_JUN05 | REPAY_MAY05 | REPAY_APR05 | BILL_AMT_SEP05 | BILL_AMT_AUG05 | BILL_AMT_JUL05 | BILL_AMT_JUN05 | BILL_AMT_MAY05 | BILL_AMT_APR05 | PAY_AMT_SEP05 | PAY_AMT_AUG05 | PAY_AMT_JUL05 | PAY_AMT_JUN05 | PAY_AMT_MAY05 | PAY_AMT_APR05 | GENDER_female | GENDER_male | EDUCATION_graduate school | EDUCATION_high school | EDUCATION_other | EDUCATION_university | DEFAULT_default | DEFAULT_not default | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 29955 | 29990 | 140000 | 1 | 41 | 0 | 0 | 0 | 0 | 0 | 0 | 138325 | 137142 | 139110 | 138262 | 49675 | 46121 | 6000 | 7000 | 4228 | 1505 | 2000 | 2000 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 1 |
| 29956 | 29991 | 210000 | 1 | 34 | 3 | 2 | 2 | 2 | 2 | 2 | 2500 | 2500 | 2500 | 2500 | 2500 | 2500 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 1 | 0 |
| 29957 | 29992 | 10000 | 1 | 43 | 0 | 0 | 0 | -2 | -2 | -2 | 8802 | 10400 | 0 | 0 | 0 | 0 | 2000 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 1 |
| 29958 | 29993 | 100000 | 2 | 38 | 0 | -1 | -1 | 0 | 0 | 0 | 3042 | 1427 | 102996 | 70626 | 69473 | 55004 | 2000 | 111784 | 4000 | 3000 | 2000 | 2000 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 1 |
| 29959 | 29994 | 80000 | 2 | 34 | 2 | 2 | 2 | 2 | 2 | 2 | 72557 | 77708 | 79384 | 77519 | 82607 | 81158 | 7000 | 3500 | 0 | 7000 | 0 | 4000 | 0 | 1 | 0 | 0 | 0 | 1 | 1 | 0 |
| 29960 | 29995 | 220000 | 1 | 39 | 0 | 0 | 0 | 0 | 0 | 0 | 188948 | 192815 | 208365 | 88004 | 31237 | 15980 | 8500 | 20000 | 5003 | 3047 | 5000 | 1000 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 1 |
| 29961 | 29996 | 150000 | 2 | 43 | -1 | -1 | -1 | -1 | 0 | 0 | 1683 | 1828 | 3502 | 8979 | 5190 | 0 | 1837 | 3526 | 8998 | 129 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 1 |
| 29962 | 29997 | 30000 | 2 | 37 | 4 | 3 | 2 | -1 | 0 | 0 | 3565 | 3356 | 2758 | 20878 | 20582 | 19357 | 0 | 0 | 22000 | 4200 | 2000 | 3100 | 0 | 1 | 0 | 0 | 0 | 1 | 1 | 0 |
| 29963 | 29998 | 80000 | 1 | 41 | 1 | -1 | 0 | 0 | 0 | -1 | -1645 | 78379 | 76304 | 52774 | 11855 | 48944 | 85900 | 3409 | 1178 | 1926 | 52964 | 1804 | 0 | 1 | 0 | 1 | 0 | 0 | 1 | 0 |
| 29964 | 29999 | 50000 | 1 | 46 | 0 | 0 | 0 | 0 | 0 | 0 | 47929 | 48905 | 49764 | 36535 | 32428 | 15313 | 2078 | 1800 | 1430 | 1000 | 1000 | 1000 | 0 | 1 | 0 | 0 | 0 | 1 | 1 | 0 |